ANFIS-based Fuzzy Systems for Searching DNA-Protein Binding Sites

نویسندگان

  • Dianhui Wang
  • Monther Alhamdoosh
  • Witold Pedrycz
چکیده

Transcriptional regulation mainly controls how genes are expressed and how cells behave based on the transcription factor (TF) proteins that bind upstream of the transcription start sites (TSSs) of genes. These TF DNA binding sites (TFBSs) are usually short (5-15 base pairs) and degenerate (some positions can have multiple possible alternatives). Traditionally, computational methods scan DNA sequences using the position weight matrix (PWM) of a given TF, calculate binding scores for each K-mer against the PWM, and finally classify a K-mer as to whether it is a putative TFBS or a background sequence based on a cut-off threshold. The FSCAN system, which is proposed in this paper, employs machine learning techniques to build a learner model that is able to identify TFBSs in a set of bound sequences without the need for a cut-off threshold. Our proposed method utilizes fuzzy inference techniques along with a distribution-based filtering algorithm to predict the binding sites of a TF given its PWM model and phastCons scores for the input DNA sequences. Data imbalance reduction techniques are also used to ease the learning of the adaptive-neuro fuzzy inference system (ANFIS) algorithm. The proposed system is tested on 22 ChIP-chip sequence-sets from the Saccharomyces Cerevisiae genome. Our results show that FSCAN outperforms other approaches like MatInspector and MATCH and is quite robust. As more transcriptional data becomes available, our proposed framework encourages the use of fuzzy logic techniques in the prediction of TFBSs. Contact: [email protected] keywords: Similarity Metric, Transcription Factor Binding Sites, Position Weight Matrix, Motif Discovery, Random Positions, Positional Affinity

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Novel Small Molecules against Two Binding Sites of Wnt2 Protein as potential Drug Candidates for Colorectal Cancer: A Structure Based Virtual Screening Approach

Wnts are the major ligands responsible for activating Wnt signaling pathway through binding to Frizzled proteins (Fzd) as the receptors. Among these ligands, Wnt2 plays the main role in the tumorigenesis of several human cancers especially colorectal cancer (CRC). Therefore, it can be considered as a potential drug target.The aim of this study was to identify potential drug candidates ...

متن کامل

Novel Small Molecules against Two Binding Sites of Wnt2 Protein as potential Drug Candidates for Colorectal Cancer: A Structure Based Virtual Screening Approach

Wnts are the major ligands responsible for activating Wnt signaling pathway through binding to Frizzled proteins (Fzd) as the receptors. Among these ligands, Wnt2 plays the main role in the tumorigenesis of several human cancers especially colorectal cancer (CRC). Therefore, it can be considered as a potential drug target.The aim of this study was to identify potential drug candidates ...

متن کامل

Reliability and Sensitivity Analysis of Structures Using Adaptive Neuro-Fuzzy Systems

In this study, an efficient method based on Monte Carlo simulation, utilized with Adaptive Neuro-Fuzzy Inference System (ANFIS) is introduced for reliability analysis of structures. Monte Carlo Simulation is capable of solving a broad range of reliability problems. However, the amount of computational efforts that may involve is a draw back of such methods. ANFIS is capable of approximating str...

متن کامل

Voting Algorithm Based on Adaptive Neuro Fuzzy Inference System for Fault Tolerant Systems

some applications are critical and must designed Fault Tolerant System. Usually Voting Algorithm is one of the principle elements of a Fault Tolerant System. Two kinds of voting algorithm are used in most applications, they are majority voting algorithm and weighted average algorithm these algorithms have some problems. Majority confronts with the problem of threshold limits and voter of weight...

متن کامل

Voting Algorithm Based on Adaptive Neuro Fuzzy Inference System for Fault Tolerant Systems

some applications are critical and must designed Fault Tolerant System. Usually Voting Algorithm is one of the principle elements of a Fault Tolerant System. Two kinds of voting algorithm are used in most applications, they are majority voting algorithm and weighted average algorithm these algorithms have some problems. Majority confronts with the problem of threshold limits and voter of weight...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016